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AMENDMENT TO THE CLAIMS 

1 . (Currently Amended) A method of determining similarity between words, comprising: 
receiving as an input a first word and a first dependency structure that includes the first 

word; 

receiving a data structure indicative of a second word and a second dependency structure 

that includes the second word; aftd 
selecting one of a plurality of different weighting measures to weight a similarity measure 

based on whether a frequency indicator indicative of a frequency of occurrence of 

the second dependency structure in training data meets a frequency threshold 

value; and 

calculating the similarity between the first and second words based on ar -the similarity 
measure weighted with the selected weighting measure, w e ight e d based on a 
fr e quency indicator indicativ e of a frequency of occurr e nc e of the s e cond 
d e p e nd e ncy structur e in training data . 

2. Canceled. 

3. (Currently Amended) The method of claim 2-l_wherein the plurality of weighting 
measures includes a co-occurrence weighting measure and a mutual information (MI) weighting 
measure. 

4. (Currently Amended) The method of claim 3 wherein weighting the similarity measure 
comprises: 

weighting the similarity measure with the co-occurrence frequency measure if the 
frequency indicator indicates the frequency of occurrence is below the a-frequency 
threshold value. 
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5. (Currently Amended) The method of claim 4 wherein weighting the similarity measure 
comprises: 

weighting the similarity measure with the MI measure if the frequency indicator indicates 
the frequency of occurrence is above the frequency threshol d value - 

6. (Currently Amended) The method of claim 2-lwherein receiving a data structure 
indicative of a second word comprises: 

accessing a data store that stores records that include words and associated dependency structures 
and frequency indicators. 

7. (Original) The method of claim 6 wherein the associated dependency structures and 
frequency indicators in the data store are stored as vectors associated with the words, and 
wherein accessing a data store comprises: 

accessing the words and associated vectors. 

8. (Original) The method of claim 6 wherein accessing the data store comprises: 
identifying candidate words in the data store by reducing the search space of records in the data 

store. 

9. (Original) The method of claim 8 wherein identifying candidate words comprises: 
accessing a lexical knowledge base to identify possible candidate words in the data store. 



10. 


Canceled. 


11. 


Canceled. 


12. 


Canceled. 



13. (Currently Amended) A natural language processing system, comprising: 

a data store storing head words and associated attributes, each of the attributes including a 
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related word that was related to the head word in a training corpus, a relation type 
indicator indicating a type of relation between the head word and the related word, 
and a frequency indicator indicative of a frequency with which the attribute 
occurred relative to the head word in the training corpus; and 
a similarity generator configured to receive an input word and an associated input 
dependency structure and to access the data store and calculate a similarity 
between the input word and head words in the data structur e store based on the 
input word and associated input dependency structure and the head words and 
associated dependency structures using a similarity measure that weights a 
similarity corresponding to a given head word bas e d on with a first weighting 
measure if the frequency indicator associated with the given head word meets a 
predetermined frequency threshold value and with a second weighting measure 
different from the first weighting measure, if the frequency indicator does not 
meet the predetermined frequency threshold value . 

14. Canceled. 

15. Canceled. 

1 6. (Currently Amended) The system of claim 4-5-13 wherein the similarity generator is 
configured to select a co-occurrence frequency weighting measure if the frequency indicator is 
below a-the predetermined threshold value . 

17. (Currently Amended) The system of claim 16 wherein the similarity generator is 
configured to select a mutual information weighting measure if the frequency indicator is above 
the predetermined threshold value . 

18. (Original) The system of claim 13 and further comprising: 

a lexical knowledge base, the similarity generator being configured to access the lexical knowledge 
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base to identify a subset of the head words in the data store as candidate words prior 
to calculating the similarity. 

19. (Original) The system of claim 13 wherein the data store stores the attributes as vectors. 

20. (Currently Amended) A system for g e n e rating annotat e d data for us e in calculating 
similarity between words using annotated data , comprising: 

a parser configured to receive a textual input and parse the textual input into dependency 
structures including words and relation types indicative of relations between the 
words in the textual input and generate a vector corresponding to each dependency 
structure, the vector including a related word, a relation type indicator, and a 
frequency indicator indicating a frequency with which the dependency structure 
occurred in the textual input; and 

a data store configured to store the words and corresponding vectors regardless of the 

frequency with which the dependency structures occurred in the textual inputr ; and 

a similarity generator configured to receive an input word and an associated input 
dependency structure and to access the data store and calculate a similarity 
between the input word and words in the data store based on the input word and 
associated input dependency structure and the words and associated dependency 
structures in the data store using a similarity measure that weights a similarity 
corresponding to a given word in the data store with a first weighting measure if 
the frequency indicator associated with the given word meets a predetermined 
frequency threshold value and with a second weighting measure, different from 
the first weighting measure, if the frequency indicator does not meet the 
predetermined frequency threshold value. 

21. (Original) The system of claim 20 wherein the frequency indicator comprises a normalized 
count value. 
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22.(Original) The system of claim 20 wherein the parser is configured to parse the textual input 
into dependency triples. 



